QUEMDISSE? Reported speech in Portuguese
نویسندگان
چکیده
This paper presents some work on direct and indirect speech in Portuguese using corpus-based methods: we report on a study whose aim was to identify (i) Portuguese verbs used to introduce reported speech and (ii) syntactic patterns used to convey reported speech, in order to enhance the performance of a quotation extraction system, dubbed QUEMDISSE?. In addition, (iii) we present a Portuguese corpus annotated with reported speech, using the lexicon and rules provided by (i) and (ii), and discuss the process of their annotation and what was learned.
منابع مشابه
Finite-state transducer inference for a speech-input Portuguese-to-English machine translation system
Statistical techniques and grammatical inference have been used for dealing with automatic speech recognition with success, and can also be used for speech-to-speech machine translation. In this paper, new advances on a method for finite-state transducer inference are presented. This method has been tested experimentally in a speech-input translation task using a recognizer that allows a flexib...
متن کاملAutomatically Recognising European Portuguese Children's Speech - Pronunciation Patterns Revealed by an Analysis of ASR Errors
This paper reports findings from an analysis of errors made by an automatic speech recogniser trained and tested with 3-10-year-old European Portuguese children's speech. We expected and were able to identify frequent pronunciation error patterns in the children's speech. Furthermore, we were able to correlate some of these pronunciation error patterns and automatic speech recognition errors. T...
متن کاملAnalysis of error type and frequency in apraxia of speech among Portuguese speakers
Most studies characterizing errors in the speech of patients with apraxia involve English language. Objectives To analyze the types and frequency of errors produced by patients with apraxia of speech whose mother tongue was Brazilian Portuguese. Methods 20 adults with apraxia of speech caused by stroke were assessed. The types of error committed by patients were analyzed both quantitatively...
متن کاملThe C-ORAL-BRASIL I: Reference Corpus for Spoken Brazilian Portuguese
C-ORAL-BRASIL I is a Brazilian Portuguese spontaneous speech corpus compiled following the same architecture adopted by the C-ORAL-ROM resource. The main goal is the documentation of the diaphasic and diastratic variations in Brazilian Portuguese. The diatopic variety represented is that of the metropolitan area of Belo Horizonte, capital city of Minas Gerais. Even though it was not a primary g...
متن کاملFluency profile: comparison between Brazilian and European Portuguese speakers.
The purpose of the study was to compare the speech fluency of Brazilian Portuguese speakers with that of European Portuguese speakers. The study participants were 76 individuals of any ethnicity or skin color aged 18-29 years. Of the participants, 38 lived in Brazil and 38 in Portugal. Speech samples from all participants were obtained and analyzed according to the variables of typology and fre...
متن کامل